This Page Is Inserted by IFW Operations 
and is not a part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of 
the original documents submitted by the applicant. 

Defects in the images may include (but are not limited to): 

• BLACK BORDERS 

• TEXT CUT OFF AT TOP, BOTTOM OR SIDES 

• FADED TEXT 

• ILLEGIBLE TEXT 

• SKEWED/SLANTED IMAGES 

• COLORED PHOTOS 

• BLACK OR VERY BLACK AND WHITE DARK PHOTOS 

• GRAY SCALE DOCUMENTS 



IMAGES ARE BEST AVAILABLE COPY. 

As rescanning documents will not correct images, 
Please do not report the images to the 
Image Problem Mailbox. 



Vol 272. No. 24, Issue of June 13, pp. 16313-15322 1997 
Y01 * ' Printed in U. SA. 



The Journal of Biological Chemistry ' , «. , T 

C 1997 byThe American Society for Biochemistry and Molecular Biology, Inc. 

targeting of Endopeptidase 24.16 to Different Subcellular 
Compartments by Alternative Promoter Usage* 

(Received for publication, February 28, 1997) 

Akira Katot, Naoaki Sugiura§, Yphko Saruta, Takehiko Hosoiri, Hiroshi YasueH, 
and Shigehisa Hirosej • 



Endopeptidase 24.16 or mitochondrial oligopeptidase, 
abbreviated here as EP 24.16 (MOP), is a thiol- and metal- 
dependent oligopeptidase that is found in multiple in- 
tracellular compartments in mammalian cells. From an 
analysis of the corresponding gene, we found that the 
distribution of the enzyme to appropriate subcellular 
locations is achieved by the use of alternative sites for 
the initiation of transcription. The pig EP 24.16 (MOP) 
gene spans over 100 kilobases and is organized into 16 
exons. The core protein sequence is encoded by exons 

5- 16 which match perfectly with exons 2-13 of the gene 
for endopeptidase 24.15, another member of the thimet 
oligopeptidase family. These two sets of 11 exons share 
the same splice sites, suggesting a common ancestor. 
Multiple species of mRNA for EP 24.16 (MOP) were de- 
tected by the 6'-rapid amplification of cDNA ends and 
they were shown to have been generated from a single 
gene by alternative choices of sites for the initiation of 
transcription and splicing. Two types of transcript were 
prepared, corresponding to transcription from distal 
and proximal sites. Their expression in vitro in COS-1 
cells indicated that they encoded two isoforms (long and 
short) which differed only at their amino termini: the 
long form contained a cleavable mitochondrial target- 
ing sequence and was directed to mitochondria; the 
short form, lacking such a signal sequence, remained in 
the cytosol. The complex structure of the EP 24.16 (MOP) 
gene thus allows, by alternative promoter usage, a fine 
transcriptional regulation of coordinate expression, in 
the different subcellular compartments, of the two iso- 
forms arising from a single gene. 

Metalloendopeptidases form a large family of peptidases that 
have a His-Glu-X-X-His (HEXXH) zinc-binding motif and pref- 
erentially cleave short substrates. For example, endopeptidase 
24.15 (EP 1 24.15), a member of this family, acts on peptides of 

6- 18 amino acid residues and exhibits no or only very weak 
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proteolytic activity against proteins (1-5). Among the members 
of this family, thimet oligopeptidase (TOP or EP 24.15) and 
oligopeptidase M (MOP or EP 24.16) are unique in their sen- 
sitivities to thiol reagents and they constitute a subfamdy, the 
thimet (thiol- and metal-dependent) oligopeptidase subfamily. 
Recent molecular cloning revealed the presence of a cysteine 
residue unique to members of this subfamily near position 483. 
This residue is absent from the other members that exhibit no 
thiol dependence (4, 5). In addition to the members of this 
family of mammalian origin, certain oligopeptidases of micro- 
bial origin that belong to this family have also been identified, 
including oligopeptidase A (OpdA) and dipeptidyl carboxypep- 
tidase (Dcp) of Escherichia coli and Salmonella typhimunum 
(6) peptidase F of Lactococcus lactis (7), mitochondrial inter- 
mediate peptidase of rat and yeast (8, 9), and saccharolysin 
(YCL57w or proteinase yscD) of yeast (10). This report deals 
with the two best characterized mammalian enzymes, namely, 
EP 24 15 (TOP) and EP 24.16 (MOP), which are members of the 
thimet oligopeptidase family. This family has also been called 
the M3 family of metalloendopeptidases in the classification ot 
Rawlings and Barrett (11, 12). 

EP 24 15 (TOP) was first identified as a collagenase-like 
peptidase or Pz-peptidase in experiments with the Pz-peptide 
that was originally designed by Wiinsch and Heidnch (13) as a 
substrate for collagenase. Although the Pz-peptide was a good 
substrate for clostridial collagenase, it turned out not to be a 
substrate for avian and mammalian collagenases (14). The 
Pz-peptide hydrolyzing activities found in avian and mamma- 
lian tissues have, therefore, been designated collagenase-hke 
peptidases or simply Pz-peptidases. Independent studies on the 
metabolism of brain peptides led to the discovery of two en- 
zymes: one was described by Camargo et al. (15) in 1972 and 
was named neutral endopeptidase and, later, endo- 0 ligopepti- 
dase A; and the other, first described by Orlowski et al (16) in 
1983 was initially named soluble metalloendopeptidase and 
subsequently endopeptidase 24.15. All these enzymes turned 
out to be the same and are now known as thimet oligopeptidase 
(17) In this report we use the abbreviated designation br" 
24 15 (TOP) cDNA sequences for the mammalian enzyme are 
now available for the rat (4, 18, 19), pig (20), and human (21). 

EP 24 16 (MOP) was also discovered independently in sev- 
eral different laboratories. 1) Heidrich et al (22) demonstrated 
a Pz-peptide hydrolyzing activity in a mitochondrial fraction ot 
rat liver, which was later shown to be distinct from EP 24.1& 
(TOP) by both biochemical characterization (23) and partial 



' The abbreviations used are: TOP, thimet oligopeptidase; EP, en- 
dooeDtldase MOP, oligopeptidase M; PAGE, polyacrylanude gel elec- 

elements); RACE, rapid amplification of cDNA ends, PIPES, 1,4- 
piperazinediethanesulfonic acid; nt, nudeotide(s). 
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amino acid sequencing of the purified enzyme; it .was .named 
oligopeptidase M (24). 2) We (25) and Kiron and Soffer (26) 
identified a soluble angiotensin-binding protein in pig and rab- 
bit liver during the course of studies aimed at identifying 
hepatic receptors for angiotensin II. After our publication of the 
cDNA sequence of the binding protein from pig (27), McKie et 
al (18) pointed out the strong similarity between our sequence 
and that of rat EP 24.15 (TOP) which had been determined by 
Pierotti et al. (4, 19). We then obtained a second cDNA clone 
which was very similar to but clearly different from that of the 
cDNA for the binding protein, and we showed that ttie second 
clone represented the pig homolog of rat EP 24.15 (TOP) (20). 
The angiotensin-binding protein, although originally identified 
as a binding protein, did indeed have thiol- and metal-depend- 
ent oligopeptidase activity (20). At that time, therefore, the 
binding protein appeared to represent a new member of the 
thimet oligopeptidase family since the amino acid sequence of 
oligopeptidase M or EP 24.16 (MOP) from no mammalian spe- 
cies had yet been determined. 3) Kawabata et «Mf. 29 > 
isolated an endopeptidase and the corresponding cDNA clone 
as a candidate for an enzyme responsible for the post-transcrip- 
tional processing of rcarboxyglutamic acid-containing blood 
coagulation factors. They failed to notice the strong Mimtanty 
to our binding protein, which was later pointed out by McKie et 
al (18) 4) Checler et al. (20, 31) demonstrated the presence of 
a novel proteolytic activity capable of inactivating neurotensin. 
They purified the peptidase from rat brain synaptic mem- 
branes and characterized it (32). The enzyme termed neuroly- 
sis or endopeptidase 24.16, was shown to be distinct from EP 
24 15 (TOP) and neprilysin (also known as enkephalinase or 
endopeptidase 3.4,24.11) and to have a relatively broad sub- 
strate-specincity and tissue distribution; Recent determinat on 
of its amino acid sequence by cDNA cloning clearly mdicated 
that neurolysin is identical to the three enzymes mentioned 
above (33). Thus, four separate lines of research have con- 
verged in the discovery of a single new member of the thimet 
oligopeptidase family. In this report we use the abbreviation 
EP 24.16 (MOP) for this protein, whose identity has been only 
recently established. . 

EP 24.15 (TOP) and EP 24.16 (MOP) are very similar m 
terms of size and enzymatic properties: both are . intracellular 
proteins of 78-80 kDa, consisting of about 680-700 amino 
acids, and their sequences are 65% homologous (20). They are, 
however, clearly distinguishable in several respects. For exam- 
ple, Giey have different specificities for inhibitors drfferent 
immunoreactivity, and different cleavage-site specificities. EP 
24 15 (TOP) hydrolyzes neurotensin exclusively attheArg-Arg 
bond whereas EP 2T16 (MOP) cleaves it at the Pro-Tyr bond 
(16 24, 32)i Another difference is found in the subcellular 
localizations of these enzymes. EP 24.15 (TOP) is found m the 
cytosol while EP 24.16 (MOP) is found in both the cytosohc and 
mitochondrial compartments. How can the product of a sing e 
gene be localized to more than one intracellular compartment. 
To answer this question and to characterize evolutionary rela- 
tionships among the members of the thimet oligopeptidase 
family, we investigated the structural organization of the pig 
genes for EP 24.15 CTOP) and EP 24.16 (MOP) and of their 
i'-proximal flanking regions. We discovered six species of 
mRNA for EP 24.16 (MOP) that are generated from one single 
gene as a result of the utilization of alternative sites for the 
initiation of transcription. The six species of mRNA can be 
classified into two categories: those containing an additional 
sequence that encodes a mitochondrial targeting sequence and 
those that lack such a sequence. The use of different promoters 
for the eventual targeting of proteins to appropriate subceUular 
compartments appears to be a useful mechanism for adjust- 



ment of local concentrations of proteins that function at differ- 
ent intracellular sites in response to the physiological require- 

ments of the cell. 

EXPERIMENTAL PROCEDURES 

r AACOTCAGAOG-a') by T4 RNA ligase. A nested ■pedficpruiwrfer 
S£*nE * pol^sa chain reaction. The products of 'P*"-™ 

Pr< ? 6 ' ^t^ofDNA-m^ was sequenced by the dideoxy chain l ter- 

templates. Termination reactions were performed w ™ *eqm Itarm 
DNA polymerase (Epicentre Technologies Mad-on W»andmD« 
labeled M13 universal or reverse pnmer (LI-COR, . Lincoln, ™>-*™ 
ptaucts were analyzed with '^^^SS: 
Sequences were organized and analyzed with GENETYX-MAO pro 

nitraie in v 3*P-l a beled probe in a solution of 6 X SSPE U x 

folanude, 0.1% SDS and 6x Denhardfs solution at 42 'C to 16 r. 
The filters were rinsed twice at room temperature in 2 X SSC (1 X SfaU 

additional rounds of screening. e^v^vi <?/irT and 

FUm, Tokyo, Japan), nrimers namely, 108L 

^StCC^^GGTAOACCATGTGGGCTGTAGAA) for exon 
I finfi^TCTCTCCATGAGAATGCTCCT) for exon 3, »ere de- 
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otide kinase (Takara) and used for. the synthesis of probes. End-labeled 
primers were annealed with 6 >g of plasmid DNA that contained 
genomic fragments of the pig gene for EP 24.16 (MOP) CAj^MM 
837-bp fragment of APAB-G33 for exons 1 and 2; BgOl-EeoBI 923-bp 
fragment of APAB-G32 for exon 3); and ant is ease probes were synthe- 
sized with T7 DNA polymerase (Pharmacia, Uppsala, Sweden). The 
3 '-ends of the probes were digested with restriction enzymes (Smal for 
exon 1, BssHU for exon 2, and BglTl for exon 3), fractionated by elec- 
trophoresis on a 6% polyacrylamide gel that contained 7 M urea and 
exposed to x-ray film. Probes were detected as bands of the expected 
mobility and extracted in 0.6 M ammonium acetate, 10 mM magnesium 
acetate, 1 mM EDTA, 0.1% SDS, and 10 pgftnl yeast tRNA at 37 C for 
12 h. Extracted probes were precipitated in ethanol, and probes U x iu 
cpm each) were annealed with 6 ug of poly(A) + RNA from pig liver or 
with 10 ug of yeast tRNA, as a control, for 12 h at SO ft C in 80% 
formamide, 40 mM PIPES, pH 6.4, 1 mM EDTA, and 400 mM NaCl. 
Non-annealed nucleic acids were digested with S 1 nuclease (Boehnnger 
Mannheim, Mennheim, Germany) at a Hnal concentration of 1,000 
units/ml in 0.28 M NaCl, 0.05 M sodium acetate, pH 4.6, 4.6 mM ZnSU<, 
and 20 figftnl denatured herring sperm DNA- The protected fragments 
were purified by extraction with phenol/chloroform and precipitation 
with ethanol, and electrophoresed in 6% polyacrylamide gels containing 
7 M urea. Gels were dried and exposed to imaging plates for 48 n. 
Images were analyzed with the Bioimage Analyzer. 

Production of Antiserum against Purified Pig EP 24.16 (MOPy- 
Rabbits were injected subcutaneously with 75 y.g of purified pig br- 
24 16 (MOP) (formerly referred to as soluble angiotensin-binding pro- 
tein, sABP (27)) in complete Freund's afluvant. Booster injections with 
75 tie of purified protein in incomplete Freund's adjuvant were given 2, 
4, and 6 weeks after the initial injection. Rabbits were bled 10 days after 
the fourth injection. IC 

Construction and Expression ofcDNAs for Isoforms ofPigEP 24.1b 
(MOP)Six plasmids, pcDNA3-MOPl (exon 1-15-16]); -MOPl (exon 
1-4-15-16]); -MOP2 (exon 2-15-16]); -MOP2' (exon 2-4-15-16]); -MOP3 
(exon 3-15-16]); and -MOP3' (exon 3-4-[5-16]), were constructed for 
expression analysis. For pcDNA-MOPl, a 2732-bp EcoB.VEcoBI frag- 
ment of PAB-L1 (27), which contained the entire open reading frame of 
type 1 cDNA for EP 24.16 (MOP), was subdoned to pcDNA3 dnvitro- 
gen, San Diego, OA). For the other plasmids, PAB-R5, -R302, -KJ05, 
-R8, and -Rl (Fig. 1), which encoded only the 6'-ends of type 1 , 2, 2 , 3, 
and 3' cDNAs, respectively, were digested with A/wNI at their 3' 



termini, ligated with the 2291-bp Alwm-EcoBl fragment of PAB-L1 
and subcloned into pcDNA3. COS-1 cells were maintained in Didbeccos 
modified Eagle's medium (Life Technologies, Inc., Gaithersburg, MD) 
that contained 10 mM HEPES, pH 7.2, 10% fetal bovine serum, 60 
units/ml penicillin, and 50 jigftnl streptomycin, in a controlled atmos- 
phere of 6% C0 3 in air at 37 «C. Approximately 6 X 10* cells were 
electroporated with 20 M g of each plasmid at 220 V at a capacitance 
setting of 960 microfarads in a Gene Pulser apparatus (Bio-Rad) and 
harvested 48 h after electroporation. 

Subcellular Fractionation of Cells and Western Blotting-Ml steps 
were performed at 4 °C. Cells were washed by centrifugation m Dui- 
becco's phosphate-buffered saline (2.7 mM KC1, 138 mM NaCl 1.2 mM 
KH0PO4, and 8.1 mM Na 2 HP0 4( pH 7.4) at 700 rpm for 2 mm. Approx- 
imately 2 X 10 7 cells were suspended in 1 ml of 2.6 M sucrose and 
homogenized for 2 min. Nuclear fractions were removed by centrifuga- 
tion at 3,000 rpm (700 X g) for 10 min, and supematants were centri- 
fueed at 9,200 rpm (7,000 X g) for 10 min to recover mitochondrial 
fractions as pellets. Mitochondrial fractions were washed twice by cen- 
trifuiration at 25,000 rpm (24,000 X g) for 10 min. The post-mitocnon- 

min and the pellets (microsomes) and supematants (cytosol) were 
recovered. The concentration of protein in each fraction was determined 
with the BCA protein assay reagent (Pierce). Five ug of each protein 
sample were fractionated by SDS-PAGE (10% polyacrylamide) in stand- 
aid glycine runningbufTer (192 mM glycine, 25 mM Tns, and 0 1% SDS) 
or high-resolution running buffer (492 mM glycine, 75 mM Tns and 
0.1% SDS). The separated proteins were transferred to a P?Jyvmyhd e ne 
difluoride membrane (ATTO, Tokyo, Japan) and probed with 2,000-fold 
fliutedrabttt antiserum 

were detected with alkaline phosphatase-conjugated second antibodies 
with 4-nitro blue tetrazolium chloride and 5-bromo-4-chloro-3-uidolyl 
phosphate (NBT/BCIP) as chromogen. 

RESULTS 

Differences among 5' -Terminal Sequences of mRNAs for EP 
24 16 (MOP)— To delineate the complete structure of the gene 
for EP 24.16 (MOP), we determined the 6'-end of the corre- 
sponding mRNA by rapid amplification of 5'-ends of cDNA 
(5'-RACE) using preparations of poly(A) + RNA from pig liver. 
More than five prominent bands of fragments of 340-630 nu- 
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etructure'of cDNA for pig EP 24.16 (MOP) is described elsewhere (27). 

cleotides were obtained. DNA sequencing of these fragments 
revealed the presence of several mRNAs whose sequences were 
completely different from others starting 28 nt upstream from 
the ATG initiation codon (Fig. 1; M3). These results suggested 
that usage of alternative promoters and exons might be in- 
volved in the generation of the observed mRNA diversity. To 
determine the precise molecular mechanism responsible for 
generation of such heterologous mRNAs, we isolated and char- 
acterized the pig gene for EP 24.16 (MOP), which had previ- 
ously been shown to be present as only a single copy (27). 

Structural Organization and 5 1 -Untranslated Exons of EP 
24. 16 (MOP) Gene— To isolate the pig gene for EP 24.16 (MOP), 
we screened approximately 2 X 10 6 independent plaques of a 
pig genomic library in AEMBL3 (CLONTECH) using the 
PAB-L1 cDNA clone (27) as the probe. We isolated and mapped 
more than 60 clones, and then we subcloned and sequenced the 
phage fragments for the identification of exons. The exon- 
intron organization of the pig gene for EP 24.16 (MOP) was 
deduced from an analysis of 11 independent clones, each of 
which contained part of the gene (Fig. 2). The gene extends over 
100 kilobases and contains 16 exons and 15 introns of various 
sizes (Figs. 2B and 4B). All the introns have typical splice donor 
and acceptor boundaries (Fig. 4D) (35). 

Comparison of the nucleotide sequences of the genomic 
clones APAB-G32 and APAB-G33 (Fig. 2D) with those of the 
products of 5'-RACE (Fig. 1) allowed us to identify the alter- 
natively spliced leader exons (Fig. 2, A and B, and 6A). Six 
distinct species of mRNA for EP 24.16 (MOP) appeared to be 
generated by differential use of three sites for initiation of 
transcription located upstream of exons 1, 2, and 3, respec- 
tively, and by the alternative splicing of exon 4; exons 1, 2, and 
3 are mutually exclusive (Fig. 6A). Exon 1 encodes a putative 
mitochondrial targeting sequence, (M)IVRCLSAARRLHR (Fig. 
6D), which is rich in basic amino acids and can be expected to 
form an amphipathic helix (36). The common exons 5 through 
16 are used to assemble the functional domain of the enzyme. 
The zinc-binding motif HEFGH is encoded by exon 12 (Fig. 4B). 
The extreme 3' exon, exon 16, encodes the last 44 amino acid 
residues, the termination codon, and the 3'-untranslated se- 
quences that include three polyadenylation signals, a short 
interspersed repetitive element (SINE or PRE-1), and an AT 
repeat, all of which were identified previously by cDNA cloning 
(27). 

There appears to be a "pseudo-exon* that encodes a protein 
that resembles a ribosomal protein (11.5 kDa, L44 (37)) in 



XPABH-414 

F.O. 3. Structure of the pig gene for EP 24 IB (TOP). A , 
tural relationship to the cDNA. The structure of cDNA for pig EP 24.15 
(TOP) is described elsewhere (20). B, exon-intron organization C re- 
striction maps. The following abbreviations are used for the restriction 
enzymes: E, ffcoRI; S, Sort; X, Xbal. D, relative locations of genomic 
clones. 

reverse orientation (3' to 5') within the untranslated region of 
the 3'-most exon (Fig. 4B). The sequence encoding the h°mol°g 
of ribosomal L44 is flanked by the direct repeat TGTTCTA- 
GAGAATTT and has a poly(A) tract, suggesting that the pseu- 
dogene might have arisen as a result of retropositton. 

Structural Comparison of the Genes for EP 24.15 (TOP) and 
EP 24 16 (MOP)— We wondered whether the complexity of 
organization of the gene for EP 24.16 (MOP) might be reflected 
in the genes for other members of the thimet oligopeptide 
family and, to this end, we also characterized the genefer EP 
24 15 (TOP). The gene for EP 24.15 (TOP) was isolated from the 
same pig genomic DNA library as that used for isolation of 
eenes for EP 24.16 (MOP), and it was found to have a much 
simpler structure in its 5'-region (Figs. 3 and 4B). The gene 
exists as a single copy, as revealed by Southern blot analysis 
(data not shown); it spans approximately 45 kilobase pairs (Fig. 
4A); and it is organized into 13 exons. The overall organization 
of the two genes is very similar with the exception of the length 
of introns and the 5'-leader and untranslated exonsJFig. 4B). 
For example, exons 2-12 of the gene for EP 2415 (TOP) ^rre- 
spond precisely to exons 6-15 of the gene for EP 24.16 (MOP) 
and there is strong conservation of the respective exon-intron 
boundaries (Fig. 4D), suggesting evolution from a common 
ancestor. The zinc-binding motif HEFGH is encoded within 
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— flTttAAAfiAflgtartctgagtg 
—fiCCTTTCTAflgttifltcccttc 
—KtATCCAAOgtictgaggatc 
— TTBTICaaGgtgafltgattta 

- "61CAACACAS gtatgact tgta 

- - -OCTACTCCAG gtatglaaatac 
— eAATCCAfiAGgtattfltatttt 



tactgwtccagttUTflCA88- 
ccectflttteag TTCAIAflflAA- 
tttcgtgtctaiftAUASflTCC 
cttttCCtttaiAAAATTAAfiA- 
tcgctcaeacaoCAGKCrCCC- 
ctatt9atCca«8A«AACTfrft 
cetcgcooccagATGAGCTOGC 
gtecctctccag C6AJU88AAG- 
tcegteocatag fiCCWOTTCS- 
tctaccacceaj GCCTCTTCAA 
gct«ttcc*af. G6ACCAACAT 
ccototeggcag GUtSCATCC 



EP 24.15 (TOP) 

{axon 1) CCCCCAGCAftgtacctgcttcc 

■— (won 2) TCCTACACABoUagcctwge 

—(exon SJ—tTattTSCASglgcgtocggca 
-—(axon 4)-— AAaCAaSAflgtgfltgcecetB 
.—{exon 5)— -USCA6CT6Ggtatgtgatgt 

---(woo 6) fiTSCAAMA8gtgagggo«t«t 

—(exon 7)— wmcCTAOgtMOOceoteo 

(exon 8) TCTACCCGAGgtgtgt 

— (axon 9>— -AIBCTCCCABgtgggtfltggjc 
- — (exaftlQ)-— GCCAACACABgt geggc tgggc 
— (exonll)- — GCCACACCASgtagccacaetc 
— (axan12)"--«AertSCAAGgtg»ggagctgg 
* — (axan13) 



FlG.4.Comparisonofthestructuresofge M ^ 
24.16 (MOP) and EP 24.15 (TOP). The exons corresponding to the zmc-Wmg ^J^^^J^^^Z^^M^ f^ 
are aligned (dotted vertical line). B, comparison of the exon-intron «»™^ *SSST« 5^"S S to,a^ a^terminal extension 
are indicated by boxes and are numbered mbcldfype. ^f.^^^ifimOFU^ZcVZm^ indicate the 8 izes of exons and 
sequences generated from the alternative sites of initiation of translation of EF 24.16 (MUf '^ n ^^^T^ the fo _ Ep 34 i 6 (MOP) and 

A nucleotide sequences of exon-intron junctions. The exons are shown in capital letters and the introns mlowercase letters. The 5 -flanking regions 
of three sites of initiation of transcription are shown in italics. 



exon 9. The 3'-terminal exon 13 is composed of a short coding 
sequence, the termination codon, and the entire 3'-untrans- 
lated sequence. The promoter region of the gene for EP 24.15 
(TOP) lacks the TATA box but contains several putative bind- 
ing sites for ubiquitous factors including one CCAAT box, three 
Spl sites, one NF-1 site, one AP-1 site, and two AP-2 sites 
(data not shown). 

Identification of Three Major Sites of Transcription Initia- 
fwn.— Characterization of the 5'-ends of mRNAs for EP 24.16 
(MOP) by 6'-RACE (Fig. 1) revealed the presence of multiple 
sites for initiation of transcription, at least one each in the 
upstream of exons 1, 2, and 3. To determine the transcription 
start sites, we performed SI nuclease mapping using poly(A) + 
RftA from pig liver and using three probes, which were com- 
plementary to exons 1, 2, and 3, respectively. The locations of 
these probes are indicated in Fig. 5A. As we had expected, we 
found three sites (Fig. 5A): one located 172 nt upstream of the 
first Met (ATG) codon of exon 1; another located at 106 nt 
upstream of the Met codon of exon 2; and the third located 23 
nt downstream of the TATA box close to exon 3. 

Putative Sites for Binding of Transcription Factor Near Sites 
of Initiation of Transcription—Inspection of the sequence of 
5'-flanking regions of exons 1, 2, and 3, which we designated 



promoter regions PI, P2, and P3, respectively (Fig. 4B), re- 
vealed potential cw-acting DNA elements (Fig. 5, B and C). 
Promoter regions 1 and 2 are very GC-rich and lack the TATA 
and CAAT boxes that are typical of eukaryotic class H promot- 
ers; promoter 3 contains a conserved TATA box, which begins 
29 nt upstream of the previously identified 5'-end of exon 3. 
The sequences upstream of exons 1 and 2 contain several 
putative binding sites for transcription factors AP-2 and Spl. 
AP-2 mediates enhanced transcription as a result of stimula- 
tion by the protein kinase C, cAMP-dependent protein kinase 
A, and retinoic acid (38-40); Spl is a protein that binds to the 
GC box specifically and is often involved in the regulation of 
so-called housekeeping genes (41). The upstream region of exon 
3 includes consensus binding sites for the transcription factors 
Myb (product of the myeloblastosis oncogene), AP-1, and 
GATA-1. The presence of multiple binding sites for hematopoi- 
esis-specific factors is intriguing: Myb has been demonstrated 
to be important in the control of the proliferation and differen- 
tiation of hematopoietic cells (42), while GATA-1 was originally 
found as an erythroid-specific factor (43). The Myb-binding site 
immediately downstream of the TATA box for exon 3 is of 
particular interest since such a juxtaposed arrangement of a 
TATA box and a Myb target sequence was recently demon- 
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tctg^tatcagceecttetgtggtgtl^^^ 
ctflttcttiaccttetccaaagcccglfe 
atggctttbctcttgctgaattc 

Fto. 6. Identification of site of initiation of transcription of the pig gene for EP 24 16 (MOPT by SI 
sequences of promoter regions (B and C). A, 6 « of poly(A)* RNA from pig liver (lanes 2, 4, and 6) or yeast tRNA (lanes 1, 3, and 5) were 
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strated to serve as a Myb-suppressible promoter (44). 

Differential Subcellular Localization ofEP 24.16 (MOP) Di- 
rected by Alternatively Generated Speces of mRNA— The results 
described above suggest that the organization of the 5'-region 
of the genes for EP 24.16 (MOP) is unusually complex and that 
six mRNA species with different 5' termini are generated as a 
consequence of the use of separate promoters (Fig. 5) and the 
splicing of the 5'-leader exons 1 through 3 (in a mutually 
exclusive manner) and of exon 4. The cDNA sequences corre- 
sponding to the six species of mRNA are shown schematically 
in Fig. 6A, and they were used for the expression experiments 
described below. It should be noted that exon 1 has an in-frame 
ATG codon (designated Ml), when connected directly to exon 5, 
and the open reading frame in exon 1 encodes a putative signal 
peptide for import into mitochondria; exon 2 also has an in- 
frame ATG codon (M2) in an appropriate context for the initi- 
ation of translation (Fig. 1) (45) and the open reading frame 
predicts an enzyme with 64 more amino acids at its amino 
terminus than the product generated by the open reading 
frame that starts with an ATG codon (M3) in the common exon 
5 (Figs. 1 and 6A). The fact that exon 1 could encode an 
amino-terminal leader sequence for targeting to mitochondria 
strongly suggests that, upon selection or elimination of the 
sequence of exon 1 via differential utilization of the multiple 
promoters, the subcellular localization of the products of the 
gene for EP 24.16 (MOP) is strictly and efficiently controlled. 
To confirm this possibility, we carried out the following 
experiments. 

The six cDNA constructs depicted on the right side of Fig. 6A 
(labeled types 1 through 3 and 1' through 3') were inserted 
separately into the mammalian expression vector pcDNA3 and 
used to trarisfect COS-1 cells. Then subcellular organelles were 
isolated from the transfectants arid the levels of EP 24.16 
(MOP) in these organelles were examined by Western blotting 
(Fig. 6, B and C). The type 1 (1-15—161) construct directed the 
synthesis of EP 24.16 (MOP) that was targeted to mitochondria 
(Fig. 6, B, lane 3, and C, MOP(Mlb)); the mitochondrial enzyme 
was slightly smaller than the unprocessed precursor that re- 
mained, as a consequence of overexpression of the protein, in 
the cytosol (Fig. 6, B, lane 2, and C, MOP(Mla)). This difference 
in size indicates that the amino-terminal mitochondrial target- 
ing sequence is cleaved after translocation of the protein into 
mitochondria. Type 1' (1-4-15-16]), in which the connection 
between exons 1 and 5 is interrupted by insertion of exon 4 
which includes a stop codon (Fig. 6£>), yielded only the cytosolic 
form of EP 24.16 (MOP) generated from the ATG initiation 
codon (M3) in exon 5 (Fig. 6B, lanes 5-8). Type 2 (2-[5-16J) 
allowed the synthesis of an amino-terminally extended cytoso- 
lic form (Fig;. 6B, lane 10, upper band). Again, as seen with type 
2' (2-4-15-161), insertion of exon 4 generated a stop codon and 
only the short cytosolic form was expressed (Fig. 6B, lanes 
9-16). With type 1 and type 2, products of translation from the 
ATG codon in exon 5 (M3) were also detected (Fig. 6, B, lane 10, 
lower band, and C, MOP(M3)\ suggesting that these mRNAs 
generate two isoforms of the protein by alternative usage of 
codons for the initiation of translation (Ml and M3 for type 1 
and M2 and M3 for type 2). The constructs having exon 3 as the 
5'-leader exon (types 3 and 3') produced only the cytosolic form 



of the enzyme (Fig. 6B, lanes 17-24), as expected from the fact 
that exon 3 contains no in-frame ATG codon. 

DISCUSSION 

In this study, we demonstrated the heterogeneity at the 
5'-end of the mRNA for EP 24.16 (MOP). Moreover, we showed 
that the heterogeneity is generated by alternative usage of 
promoters and splicing of multiple 5 '-leader and untranslated 
exons and that it is responsible for the differential subcellular 
localization of the products of translation. 

Targeting of Proteins to Different Subcellular Locations by 
Alternative Usage of Promoters: EP 24.16 (MOP) Represents the 
First Example of Such a Mechanism for Intracellular Pepti- 
dases /Proteinases— Proteins, after their synthesis, must be de- 
livered to their sites of action. Delivery is usually accomplished 
with the help of terminal or internal targeting sequences. Se- 
quences for the targeting proteins to the following sites have 
been identified: mitochondria, endoplasmic reticulum, lyso- 
somes, nuclei, and peroxisomes. 

The presence of a putative mitochondrial targeting sequence 
at the amino terminus of the precursor to EP 24.16 (MOP) was 
first deduced by Serizawa et al (24) from the potential ability 
of this sequence to form an amphipathic a-helix with a hydro- 
phobic and a positively charged face of the type expected for a 
mitochondrial leader sequence (36, 46). This scenario explains 
the presence of EP 24.16 (MOP) in mitochondria. The enzyme 
is, however, known also to be present in the cytosol and, prior 
to the present study, the mechanism responsible for this dis- 
tribution of EP 24.16 (MOP) has remained unclear. Discovery 
of 5'-end variants of the mRNA for EP 24.16 (MOP) by the 
5'-RACE technique led us to investigate the genetic basis for 
such diversity. Through an analysis of the structure of the 
gene, which led to the identification of the three 5'-leader exons 
that are selected, in a mutually exclusive manner, by use of 
alternative promoters and splicing, we provided the following 
resolution of this problem (Fig. 7). If promoter 1 is used, the 
mitochondrial isoform of EP 24.16 (MOP) is generated by splic- 
ing of exon 1, which has a sequence that encodes a signal for 
transport to mitochondria, to exon 5, which is the beginning of 
the common translated region that encodes the mature portion 
of the protein (type 1 in Fig. 6, A and JD). The precursor form 
(704 amino acid residues) with the mitochondrial targeting 
sequence is processed to the mature mitochondrial form of 667 
residues (Fig. 7). The type 1 transcript can also yield the 
cytosolic form of 681 amino acids when the M3 site of initiation 
of translation is used instead of the Ml site. If promoter 3 is 
used and exon 3, which lacks an in-frame ATG codon, is joined 
to exon 5 (type 5 in Fig. 6, A and £>), the cytosolic isoform is 
produced from the ATG initiation codon in exon 5. The use of 
promoter 2, which directs the synthesis of a cytosolic variant, is 
discussed below. 

Similar senarios have been reported for several other en- 
zymes that are known to occur and function in more than one 
subcellular compartment (for a recent review, see Ref. 47). 
Topical examples are the histidine and valine tRNA syntheta- 
ses of Saccharomyces cerevisiae that are involved in protein 
synthesis in the cytosol and the mitochondria (48, 49). In these 
cases, two types of transcript (long and short) are produced by 
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DNA probes are shown on the right B, the sequence of the 837-bp ^'^J^ of 67%. Nine binding 

of initiation of transcription are indicated by arrows Exon 1 andexon 2 are .located '^^^^^^^ ^nt exons and the 
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alternative usage of promoters, and the long transcript yields 
the mitochondrial isoform exclusively, while the short tran- 
script yields the cytosolic enzyme. In this way, adjustment of 
the levels of the proteins to the needs of each compartment is 
possible. Although the biological significance of this mecha- 
nism in the present case is not immediately apparent since the 
true substrates of the enzyme have not yet been identified, the 
general regulation of expression of the gene for an oligopepti- 
dase by transcription regulatory factors and the unique regu- 
lation, as reported herein, of targeting of the product by use of 
alternative promoters seems to provide a powerful method by 
which cells can modulate the concentration of specific peptides 
in certain intracellular compartments to reflect the metabolic 
state of the cell. 

Complex Organization of the Pig Gene for EP 24.16 
(MOPh-bs compared with the gene for EP 24.15 (TOP), an- 
other member of the thimet oligopeptidase family, the gene for 
EP 24.16 (MOP) have quite a complicated structure in its 5'- 
and 3 '-regions. The two genes do, however, exhibit extensive 



similarity in the regions that encode the mature proteins, 
which consist of 11 exons, namely, exons 2 through 12 in the 
case of the gene for EP 24.15 (TOP) and exons 5 through 15 in 
the case of the gene for EP 24.16 (MOP). The similarity sug- 
gests that these two genes and, probably, the genes for other 
members of this family were generated from an ancestral gene 
as distinct sequences as a consequence of gene duplication. The 
presence of a SINE in the 3'-untranslated region of the gene for 
EP 24.16 (MOP) suggests that insertion of a SINE after the 
gene duplication might have destabilized the gene for EP 24.16 
(MOP) and stimulated extensive diversification of 5'- and 3'- 
regions by recruiting the entire gene for ribosomal protein L44 
(in the reverse orientation) into the 3'-most exon (exon 16) and 
the 5'-leader exons into the 5'-flanking region by, perhaps, 
retroposition and gene conversion. 

Long and Short Forms ofEP 24.16 (MOP)— The use of pro- 
moter 2 of the gene for EP 24.16 (MOP) yields the type 2 
transcript, which is predicted to have an amino-terminally 
elongated product (Fig. 7). Consistent with this prediction, we 
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detected a long form of the protein in the cytosol of COS-1 cells 
transfected with the type 2 construct after SDS-PAGE and 
Western blotting (Fig. 6B, lane 10). Although the relative abun- 
dance of the corresponding mRNA, as estimated from the data 
after 5 '-RACE, in the pig liver is low (<10%), the physiological 
significance of this form clearly merits further study. The roles 
of the extended amino- terminal region of 64 amino acid resi- 
dues (Fig. 6DJ could include stabilization of the enzyme, mod- 
ulation. of the substrate specificity, and/or mediation of inter- 
actions with other cytosolic proteins. 

In our analysis, we also noticed the presence of a splice 
variant that lacked the sequence of exon 15. 2 This variant 
should encode a protein with a short and slightly different 
carboxyl-terminal tail. The functional significance of this var- 
iant and the tissue- and development-specific regulation of the 
splicing will be the subject of further research. The presence of 
at least two forms of EP 24.16 (MOP) has also been demon- 
strated in purified preparations of the enzyme from rabbit and 
pig liver (50, 51). 

The type 1, type 2, and type 3 species of mRNA all have splice 
variants, designated type 1', type 2', and type 3', respectively, 
with an extra exon sequence (5 '-untranslated exon 4, Fig. 7), 
but none of them results in a long open reading frame from the 
first initiation codon. It is unknown therefore, whether these 
variants have any biological significance. However, the possi- 
bility exists that the variant species of mRNA might contribute 
to the regulation of rates of translation. Alternatively, they 
might produce short peptides with as yet unidentified func- 
tions. In the case of the type 1' transcript, the insertion of the 
sequence of exon 4 provides a mechanism by which the synthe- 
sis of the mitochondrial isoform is suppressed even under con- 
ditions under which promoter 1 is active but there is no mito- 
chondrial requirement for the oligopeptidase. 

Conclusion— Analysis of the gene for EP 24.16 (MOP) re- 
vealed the very complex organization of the gene and the pres- 
ence of a variety of transcripts generated by differential use of 
multiple sites of initiation of transcription and by alternative 
splicing of exons 2, 3, 4, and 15. In contrast to these complex- 
ities, a simple and definitive answer was obtained to the ques- 
tion of how the product of a single gene for EP 24.16 (MOP) is 
delivered to two different cellular compartments, namely, the 
cytosol and the mitochondria. 
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